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Abstract 

We develop a gradient-flow framework based on the Wasserstein metric for a parabolic 
moving-boundary problem that models crystal dissolution and precipitation. In doing so we 
derive a new weak formulation for this moving-boundary problem and we show that this 
formulation is well-posed. In addition, we develop a new uniqueness technique based on the 
framework of gradient flows with respect to the Wasserstein metric. With this uniqueness 
technique, the Wasserstein framework becomes a complete well-posedness setting for this 
parabolic moving-boundary problem. 

1 Introduction 

1.1 Wasserstein formulations of parabolic PDEs 

Since the seminal work of Otto [15j [23] a large number of diffusive, parabolic partial differential 
equations has been cast in the form of a Wasserstein gradient flow. Besides linear and nonlinear 
diffusion equations [I] , also convection-diffusion equations [IS] , non-local equations [5] and higher- 
order parabolic equations [221 1131 HI] have been written as gradient flows with respect to the 
Wasserstein metric. Another extension is to abstract gradient flows in general metric spaces [Ij. 
While in many of the cases mentioned above the functional that plays the role of energy was 
already known, as a Lyapunov functional, the combination with the Wasserstein metric is a recent 
development and sheds new light on the problem. From a modelling point of view, the additional 
structure provided by the gradient flow has the effect of characterizing the system: once both 
the energy and the dissipation metric have been chosen, these choices fully determine the system, 
and leave no further room for variation. In this way the energy and the metric together clearly 
show the modelling choices that lie at the basis of the equations. The gradient-flow structure also 
suggests canonical time-discrete approximations, yields estimates, and provides a context in which 
approximations can be constructed which preserve these desirable properties. 

In this paper we extend the framework to a simple moving-boundary problem. We introduce 
a new weak formulation for this problem, and we exploit the Wasserstein framework to prove 
well-posedness of this formulation, including a contraction result in the Wasserstein metric. 

We hope that this paper will be interesting to two communities. To the reader who is famil- 
iar with the weak-solution approach to moving-boundary problems, we show how the Wasserstein 
gradient-flow context provides a thermodynamically meaningful and mathematically sound frame- 
work in which existence, uniqueness and stability results can be derived. The thermodynamic setup 
provides additional insight into the behaviour of solutions. For the reader who is familiar with 
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the Wasserstein gradient-flow formulation of parabolic evolution equations, we show how a class 
of moving-boundary problems can be included, and how the Wasserstein framework provides a 
convenient tool for proving well-posedness. 



1.2 Crystal dissolution and precipitation 

The problem that we consider in this paper arises in the modelling of crystal dissolution and 
precipitation [TTl HT] , and it represents a larger class of reactive solid-fluid interactions in which 
mass crosses the interface that separates the solid and fluid phases (see also [IHlllS]). The model 
defines two species, water and solute, which may occur in mixed form (in the fluid phase) or as 
pure phases (as in the solid phase, which consists only of crystalline solute). At the solid- fluid 
interface the solid crystal may precipitate or dissolve, thus moving the interface in one direction 
or another. The rate of dissolution or precipitation is given by a function of the local solute 
concentration in the fluid. 

We describe the concentrations of the two species by their volume fractions p (for water) 
and 1 — p (for solute). Note that this choice deviates from what is usual: the function p is 
the water volume fraction, not the solute concentration. The reason for this choice is that the 
problem takes a particularly simple form in terms of this variable: the solid crystal phase then 
corresponds to {x : p{t,x) = 0}, and the moving interface corresponds to boundary of the fluid 
region Q{t) — {x : p{t, x) > 0}. 

The water is assumed to evolve by diffusion within the fluid region fl{t); the boundary of 
this region itself evolves in time, with normal velocity w„, and the boundary condition for p is 
determined by the requirement that no water traverses the boundary. These assumptions lead to 
the equations 

dtp = Ap n{t), (la) 

1^ - ^pvn dn{t). (lb) 

Here n and w„ are the outward normal and the normal velocity of the boundary dfl; n points from 
the fluid into the solid phase. For simplicity we disregard the possibility of non-moving boundaries 
of ri, such as rigid walls. Note that p will typically have a non-zero limiting value at the inside of 
dfl, and we consider it to be zero outside of ft (in the solid). Therefore p is discontinuous at dil; 
the value of p appearing in (jlbp is the interior value. 

The evolution of the boundary is given by a kinetic evolution law, 

= /(p) dn{t), (ic) 

in which the rate function / characterizes the precipitation and dissolution processes. In the 
context of a two-phase, water-solute system, the water volume fraction p also characterizes the 
solute volume fraction 1 — p, and therefore we can consider the dissolution/precipitation rate / a 
function of p. Equations palllcl) form a closed system for 51 and p. 

Problem ([l} is related to the so-called Stefan problem with kinetic undercooling (see e.g. [3T] 
or the introduction of |10j for an overview). Both systems have a conserved quantity, the volume 
fraction p in ([T]) and the enthalpy in the Stefan problem; in both cases the interfacial velocity 
is prescribed explicitly. The two problems represent different cases, however, corresponding to 
different relative rates of bulk diffusion and interfacial reaction. In the Stefan problem, reaction 
is fast and the heat supply slow, leading to a continuous temperature and a rate Vn that depends 
on the local heat flux. In the problem ^ the diffusion and the reaction have similar rates, and 
the discontinuity in p (or in the chemical potential, to be precise) drives the interfacial movement. 

1.3 A Lyapunov functional 

Typical rate functions / are such that high solute values lead to precipitation, and low values to 
dissolution. We formalize this property by assuming that there exists a threshold volume fraction 
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a e (0, 1) such that 

f{p) ^0 ^ p^a. 
In this case the system ([T]) has (at least formally) a Lyapunov functional 



E{n,p) := / [p\ogp + a], 



(2) 



where it is understood that suppp C fJ. Indeed, 
d 



E{Vt{t),p{t))= i\ogp+l)dtp+ {p\ogp + a)vn 



n(t) 

/ (logp+l)Ap+ / {p\ogp + a)f{p) 
Jn{t) Jdfi{t) 

' ^ ^{logp+l)^ + {p\ogp + a)vn 



dn{t) 



n{t) P Jdn(t 



+ / {^P + a)f{p) 
n{t) P Jan(t) 



< 0. 



This derivation not only shows that E decreases along a solution, it also provides an expression 
for the rate of dissipation. Using ((Tc]) we write this rate as 



-Einit),p{t)) 



\Vp{^ 



o(t) 



a 



dn{t) 



fip) 



(3) 



The two terms have clearly recognizable origins: the first is associated with the diffusion of the bulk 
water, while the second arises from the precipation and dissolution reaction on the boundary 951. 
In the special case that f{p)/{p— a) is constant, the second term on the right-hand side depends 
only on w„. We show in this paper that for this case, and in one space dimension, the system ([1]) 
can be written as a gradient fiow of E with respect to the Wasserstein metric; this property will 
be the basis for the well-posedness results that we prove. (In Appendix |X] we also give a heuristic 
derivation of the gradient flow property). 

As an aside, we note that the functional E can be given a simple thermodynamic interpretation. 
We assume that heat is conducted rapidly with respect to other processes, and that the system 
is kept at constant temperature 9 by placing it in contact with a heat bath. In this situation the 
Helmholtz free energy. 



tp{p{t, x), (p{t, x)) dx, with ip{p, If) := e{p, ip) - es{p, ip), 



should decrease in time. In this formula e and s are the internal energy and the entropy of the 
system; p is a. phase indicator, which takes the value in the solid and 1 in the fluid. 
In view of the formula ([2]) we make the following choices for e and s: 



e{p,p) = Clip 



and 



siP,V) = -pl0gp + C2V5, 



where ci and C2 are constants. The coefficient ci is the latent heat that is absorbed upon dis- 
solution, and it should therefore be positive; C2 is an entropy penalty associated with the solid 
crystal, or an entropy advantage associated with dissolution, and should therefore also be positive. 
The function —plogp is the usual Gibbs-Boltzmann entropy for freely diffusing particles. Upon 
combining these choices we find that the Helmholtz free energy equals the functional E, where 
a = ci/6 - C2. 
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1.4 Weak solutions 



Solutions of ([T]) may lose regularity, for instance because of changes in topology of and 
therefore a concept of weak solution is necessary. To our knowledge, a weak formulation of 
problem ([1} is currently not known. Such a weak formulation should allow for lack of regularity 
both in p and in d^l{t), and this is where the difficulty lies. Since this aspect is central to our 
work, let us explain the issue in detail. 

The usual way of deriving a weak formulation proceeds by multiplying (jlap with a test function 
C e C^iM. X M") and integrating over R+ x R". We calculate 



ln{o) Jo Jn{t) Jo Jdn{t) 



and 

dp 



CAp - / / - / / ^^^P 

dn{t) on Jq 



When combining these with (jlap and (|lbp the boundary terms cancel, and we find 

/ / [p6 - VpV^] = - / p(0, x)e(0, x) dx for aU ^ e C^{R+ x M"). (4) 

Note that in the derivation of (|1]) we did not yet use the kinetic condition (fTcl) . Therefore the 
condition (Hj) applies for any given evolution of ^{t), or put differently, the condition (Hj) does not 
yet fix the evolution of il{t). One of the aims of this paper is to derive a weak formulation which 
captures the evolution of both p and f2, and for which existence and uniqueness results can be 
proved — at least in one dimension. 

1.5 The one-dimensional problem 

We now turn to the specific problem of this paper. We study a one-dimensional version of prob- 
lem ([1]), and we restrict ourselves to solutions whose support ft{t) is a single bounded interval 
[L{t), R{t)]. In addition we choose the specific rate function f{p) — [p — a)/ (3 (see Section [L7l 
for a discussion of this choice) . Under these conditions we derive below a weak formulation which 
encodes both the mass-conserving transport problem (|laj|lbp and the domain evolution law (jlcp . 
In this one-dimensional version of ([1]) we seek a triplet (p, L, R) satisfying 

Pt — Pxx, 
p{0,x) = po{x), 
p,{t,L{t))=-p{t,L{t))L'{t), 
pS,R{t))^-p{t,R{t))R'{t), 
a-p{t,L{t)) 



L'{t) 
R'it) = 



a-pit,R{t)) 

/3 



at each {t,x) with t > 0,x £ [L{t), R{t)], 


(5a) 


X e [Lo,Ro], 


(5b) 


t > 0, 


(5c) 


t > 0, 


(5d) 


t > 0. 


(5e) 


t > 0. 


(5f) 



Here a and /3 are two strictly positive constants, and subscripts denote differentiation. 
Starting from which in one dimension becomes 

[P^t - Px^x] = - / p(0, x)C(0, x) dx, 



10 JL{t) JL(0) 

we perform another partial integration on the second term to find 

(•R{t) POO pRiO) 

Ht+p^xx]- [p{t,R{mx{t,R(t))-p(t,L{mx{t,Lm = - / p{Q,x)ao,x)dx. 

L(t) Jo Jl(0) 

(6) 



4 



The trace of p on the boundary x — R(t) can be rewritten using the boundary condition ([5f|). 

pit, RimAt, R{t)) = ^R'mAt, m) + ^ut, m)- (7) 

and the middle term in this expression can again be rewritten by using the identity 

R{t)) = (tit, R{t)) + Ut, R{t))R'it). (8) 

Applying these remarks to © results in the following definition. By £(R+) we mean the set 
of Lebesgue measurable functions on R+; E is the energy defined above, i.e. 



E{p,L,R):^J p\ogp + a{R- L). 



IL 

Definition 1.1. (weak solutions) 

We call {p,L,R) G L°°(R+; ^^(M)) x £(M+) x £(M+) a weak solution to problem 0) if 
1. p > 0, and for all t > 0, L{t) < R{t) and suppp(t, •) C [L{t),R{t)]; 
2- £^max := esssup^-^aE{p{t),L{t),R{t)) < oo; 
3. for all^e C,°°(R2) 

poix)ao, x)dx - (3^0, Rm ~ 13^0, m) 

L(0) 



p{t,x)^t(t,x)dxdt- 13 Ut, R{t))dt - 13 ^t{t,L{t))dt = 

"'L(t) JO Jo 

R(,t) poo poo 

pit,x)^^^{t,x)dxdt-a ^^{t,R{t))dt + a £_^{t, L{t))dt. (9) 

L(t) Jo Jo 

Remark 1.2. Note that the weak form ^ now only requires integrability of p; no derivatives or 
trace values are used. Similarly, the domain limits L and R are not required to be continuous in 
time. Of course, higher-regularity results may give us much more regularity than this; for instance, 
the properties of the heat equation imply that the function p is smooth inside the domain, and 
Lemma 13.21 shows that solutions automatically have continuous boundaries. The main point here 
is however that the formulation itself does not require additional regularity. 



Remark 1.3. The second condition in Definition 11.11 is new to the discussion. For a classical 
solution of a gradient-flow system the energy decreases with time; for such a solution the global 
upper bound on the energy reduces to a finite-energy condition on the initial datum. Any gener- 
alization of the solution concept should preserve the monotonicity of the energy, and indeed the 
energy also decreases along solutions in the sense of Definition 11.11 

The energy bound plays a role in proving uniqueness of weak solutions. The superlinear 
control on p that follows from this bound provides continuity of the function t i— >■ p{t) in the 
topology a{L-^,L°°) (see Lemma \'3.2\i . and this additional regularity is just enough for the proof 
of contraction. 

Remark 1.4. Since we do not explicitly assume any regularity in time other than measurability 
of weak solutions, the proof of their uniqueness will be different from uniqueness proofs found 
in for instance [J. Indeed, the latter results consider uniqueness in the class of functions that 
are absolutely continuous in the time variable. Moreover, uniqueness is proven for solutions of 
the so-called Evolution Variational Inequality [H (4.0.13)], and not directly for the differential 
equation On the other hand, in [1] uniqueness results are obtained without the second 

assumption and with a slightly more general initial condition. Therefore, we cannot state that 
one result is stronger than the other. 

Remark 1.5. Weak solutions that are sufficiently regular also are solutions of ([1]) in the classical 
sense. 
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1.6 Main results and layout of the paper 



We start the next part of this paper by introducing some basic concepts and notation, most 
importantly the Wasserstcin distance and some of its properties. We then turn to the main result 
of this paper: 

Theorem 1.6. • Let Lq < Rg, po e L^(R), po > 0, and swpp po C [Lq,Ro]. Then there exists 
a weak solution {p,L,R) with initial datum [po^ Lq, Rq). 

• The solution operator is a contraction in the Wasserstcin metric. 

The contraction is proved as Theorem 13.11 in Section |31 As an intermediate result we derive 
an evolutionary variational inequality (|26p from which the contraction follows readily. The main 
ingredients for the proof of this evolutionary inequality are the Kantorovich duality formulation 
of the Wasserstcin distance and a reformulation of the weak-solution definition formula In 
Section 2] we use the well-known method of time discretization to prove the existence of weak 
solutions (Theorem Unj- We have chosen to include this proof for the specific case rather than to 
refer to a more general theory in order to be self-contained and accessible. However, in Section [SJ 
we will place the problem in the context of the general theory by Ambrosio, Gigli, and Savare 
of gradient flows in metric spaces fl^. Doing so, we easily obtain some extra results, such as an 
energy identity and regularity results. 

In Appendix |X] we provide a heuristic discussion that illustrates in a different way how the 
system of this paper can be viewed as a Wasserstcin gradient flow. This discussion also motivates 
the specific choice /(s) = (s — a)//3. 

1.7 Discussion 

Some of our results are not new: existence and uniqueness for classical solutions of a very similar 
one-dimensional problem has been proved by Van de Fliert and Van der Hout [TT], and general 
methods for Stefan-like problems can also be applied [T2]. The restriction to one dimension also 
allows for a formulation in terms of the variable U (x) := p, which is very similar to a classical 
Stefan problem [5H] . Interestingly, the convenient expression of the Wasserstein distance in terms 
of the inverse of this function U shows that this approach is actually related to that of this paper. 
On the other hand, other results are new, such as the weak formulation and its well-posedness, 
and the Wasserstein contraction property of the solution operator. 

In addition to proving new results, however, we also aim to demonstrate that this gradient- 
flow framework is a natural setting for this problem. This claim of 'naturalness' becomes apparent 
in a number of different places. To start with, the energy E has a meaningful thermodynamic 
interpretation. In fact one can also start with the energy E and the appropriate dissipation 
penalization (given by (j44p in Appendix |X| and derive the evolution equations ([T]) from these two 
choices. In addition, since no other gradient-flow frameworks are known for this problem, the mere 
existence of this structure is remarkable. 

The second indication of 'naturalness' arises in the fact that we prove existence, uniqueness, and 
stability of weak solutions all within the context of Wasserstein gradient flows. Only requiring very 
weak regularity of potential solutions, the necessary compactness properties follow from intrinsic 
properties of the energy E and the Wasserstein distance. 

Finally, a third indication of naturalness can be found in the derivation of the contraction 
property from the definition of a weak solution. Although slightly obscured by technicalities, the 
contraction theorem springs from a convenient combination of the test-function behaviour (j2ip 
with the Kantorovich duality characterization (llSp . Together these yield the central inequal- 
ity . In subsequent work we have developed this concept further . 

However, we have to accept two major limitations in order to make this scheme work: the 
specific choice of / and the single space dimension. These are fundamentally linked to both the 
existence of the weak solution and the use of the Wasserstein distance. 
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The rate function /. The choice f{p) = {p — a)/(3 initiahy arises as the only possibihty in 
the derivation of the weak formulation: the manipulation of the terms in ([THE]) requires exactly 
this form in order to be successful. Interestingly, the same choice is special in the dissipation 
balance ([3]): for any other choice of /, the trace of p on dfl appears in this equation. Whether the 
double appearance of this form of / is connected at some level is an intriguing question. 

The single space dimension. In this paper, movement of the boundaries L and R is 
penalized by adding two Dirac delta functions (with weight (3) to the function p. For such an aug- 
mented probability distribution the Wasserstein distance appropriately penalizes both movement 
in p and in the boundary positions. 

In higher dimensions the interfacial area will vary with time, and therefore simply weighting the 
interface with '/3 times the {n — l)-dimensional Hausdorff measure' is not possible: the Wasserstein 
distance between two measures of unequal mass is undefined. Modifications such as varying the 
weight /3 such that the total mass is constant, or transporting only the common part of the mass [7], 
appear to have unphysical effects. 

The central question is how to convert the dissipation tensor (j44p into a metric distance. The 
same dissipation tensor appears in the (formal) description of motion by mean curvature as a 
gradient flow of the area functional; in the context of this problem some steps have been made. 
Luckhaus introduced a time-discretisation of the Stefan problem with Gibbs-Thomson law [171 HI] ; 
Roger later modified this time-discrete problem by replacing global minimization by a form of 
local minimization |25| . Almgren, Taylor and Wang used a similar approximation in the context 
of crystal growth [5] . These time-discrete minimization problems have the formal structure of the 
common backward-Euler approximations of gradient fiows [3], and they appear to reduce, in the 
limit of small step size, to (|44)) . However, the rigorous definition of an associated metric distance 
still presents difficulties. 

In addition, the derivation of the weak formulation does not generalize from one to higher 
dimensions. The higher-dimensional equivalent of ([5]) reads 

^ / ^ds = [ [Ct + • nvn - ■ nvn] ds, 
Jdn{t) Jan(t) 

where H is the total curvature of the boundary. The additional curvature term prevents us from 
applying the one-dimensional arguments to higher dimensions, at least in the current form. 

In conclusion, in this paper we use the Wasserstein metric to represent not only the dissipation 
in the diffusing bulk but also the dissipation at the reacting boundary. This appears to be unique 
to one dimension and to a specific rate function. However, the formal structure of a gradient flow 
of E with respect to a dissipation metric of Wasserstein type — see Appendix |2] — is generally valid. 
This leaves some hope for generalizations of the scheme to higher dimensions as well and it will 
be an interesting challenge to place these generalizations on a rigorous footing. For now, we will 
carefully work out the scheme in the one-dimensional case. 



2 Energy and metric 

In this paper a 'measure' is always a non-negative Borel measure, and RM{M.) is the space of 
Radon measures, i.e. measures p such that p(R) < oo. Where necessary, the one-dimensional 
Lebesgue measure is denoted C 



2.1 The state space G 

The unknown in the deflnition of a weak solution above is a triplet {p,L,R). Throughout this 
paper it will be useful to bundle the three components into a single object, which is an element of 
the space 

G = |/i e RM{R) : p = pC + I3Sl+ I35r, p>0, J p=l, suppp c , 
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i.e. ii fieG, then for cj) e C(M) 

(j>d^i= I dx + l3(t>{L) + I3(I){R). 



We will write this identification as ^ — {p,L,R). Note that with this notation, applied both to 
the solution p and the initial datum po, the equation © that defines a weak solution can also be 
written as 



J my)dMy)- J S.t{t,y)dp{t-y)dt 



/ p{t,y)(yy{t,y)dydt-a I S,y{t,R{t)) dt + a j ^y{t, L{t)) dt. (10) 

-JLit) 



2.2 Energy 

As above the energy i? : G — > M is defined by 



Eip) / plogp + aiR-L). (11) 



where fi — (p, L, R). 

Lemma 2.1. (Energy bounds) For every p Cz G, 

E{p) > E'min := loga + 1, 
and if E{p) < M, then there exists M' — M'{M,a) such that 



Jp\ogp<M and R-L<M'. (12) 



Proof. Using Jensen's inequality we calculate 



E{p)^{R-L)[ ^Mi^i^d. + a(i? - i) 
Jl R - L 



>\ogl^j^^-^+a{R-L)>\oga + l. (13) 

where the last inequality is a property of the function x ^ log(l/a::) + ax. To prove we note 
that (fO)) implies that 

- log(i? -L) + a{R ~L)< M 
and since the function .x h- )• — log a; + ax is unbounded as a; ~> oo, the result follows. □ 
Corollary 2.2. For every weak solution p = {p,L,K), we have sup(^Q(i?(i) — L{t)) < oo. 

2.3 The Wasserstein distance 

We now turn to the Wasserstein metric. Let po and pi be measures on M satistfying /io(K) = /-ii(M). 
We say that a measure 7 has ^0 and pi as its marginals, if one of the following equivalent conditions 
holds: 

• For aU Borel sets A C M, 

-f[A X M] = po[A] and j[M. x A] = pi[A]. 
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For (^,V^)eLiJR)xLi^(R), 

[(p{x) +ip{y)]d'y{x,y) ^ I ip{x) dpo{x) + I ipiy) dfii{y). 



The set of all measures satisfying these conditions we call r(/io,Mi)- We say that a measurable 
function T : M — > R pushes fiQ forward to /ii, and we write /ii = T^fiQ, if 

• Hi[A] — p,o[T^^{A)] for all Borel sets A C M, or equivalently 

• for all ip e L^{pi), 

'fiiy) dp.iiy) ^ / (p{T{x))dfioix). 



If po and pi also satistfy x^dp^ix) < oo and y^dpi (y) < oo, then the Wasserstein distance 
is defined by 

W2(Aio,Aii)^ := inf </ {x - yfd-y^x^y) \ € r{pa, fJ-i) > ■ 

The measure 7 that achieves this infinium is called the optimal transport plan [29j . A useful 
characterization of optimality is the following [301 Theorem 5.9]: An admissible transport plan 
7 S T{pP,p^) is optimal iff supp7 is monotonia, i.e 

y{xi,yi), {x2,y2) G SUPP7, (xi - X2){yi - ^2) > 0. 

As a result we have the simple characterization for distances in G, 

Wi{p',p') = - L'f + Wi{p\ p') + f3{R" - R'f- (14) 

The distance W2 also has the dual representation [30^ Theorem 5.9] 

^W2(mo,A«i)' = sup \ I i^x^ -^{x))dpo{x)+ I -v*{y))dpiiy)\ . (15) 
^ ip&L^ipa) Ur Jr J 

Here ip* is the convex conjugate (the Legendre transform) of ip. If the optimal transport plan 7 
can be characterized in terms of an optimal transport map, i.e. a convex function ip such that 

C{x,y)d-f{x,y) ^ C{x,p' (x)) dpo{x) for any C G Cc(M^), 



then the supremum in (jl5p is reached at (p. 

Equipped with the metric W2 the set G is a metric space. Convergence in this space is 
equivalent to weak-* convergence of the measures together with a uniform bound on their support 
(see e.g. [HI Theorem 7.12]): 



W2(/in,/i) — >0 



/ 0(d/i„ - dp.) — > for aU e Gc(M), and 

SUp„ \Ln\, \Rn\ < 00. 



We shall also use 

Lemma 2.3. The energy E is lower- semicontinuous in the Wasserstein metric. 

Proof. Let W2{pmP) — t- as n ^ 0. By (fT4)) this implies that each of ]L„ — L\, ]i?„ — R\, and 
W2{pm p) vanishes. The term R — L in pip then converges, and by the lower-semicontinuity of 
p H> / plogp (e.g. OS]), we have 



plogp < lim inf / p„ \ogpn. 

□ 
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2.4 Regular elements of G 

An important way of regularizing elements of G is to replace them by approximations for which p 
is bounded from above and away from zero on [L, R\. 

Definition 2.4. An element pL — {p, L, R) £ G is called regular if p € L°°(R) and essinfj^ p > 
0. 

If one end point is regular, the Wasserstein distance can be characterized in a particularly 
useful way. 

Lemma 2.5. Let p^ , p^ £ G and assume that p} is regular. Then there exists a convex function 
V e W^^l(R) such that 

= and = (16) 

and 

\W^{p\p') = j{\x^-^{x))dp'{x)+ jCjy^-^*{y))dp\y). (17) 

The proof is given in the appendix. Using this characterization we exploit the displacement 
convexity [IQI of E to prove 

Lemma 2.6. Let p^\p^ £ G and let p^ be regular. Then 

E{p^) - Eip"^) > / (1 - ip"ix))p'>{x) dx + aiR^ - R^^ - L^ + L°), (18) 

where (p is the function given by the previous lemma. Here J <~p" p^ should be interpreted as +cxd if 
tf"p^ is not integrable. 

Proof. The displacement convexity of p i-> j p log p gives 

j pHogp'-J p"logpO> j{l-^"{x))p\x)dx. 

This follows for instance from taking the function tp in (1361) equal io x ^ x ~ p'{x). The second 
term on the right-hand side of ([T5)) is immediate. □ 



3 Contraction 

In this section we show that weak solutions satisfy a contraction property: the Wasserstein distance 
between two solutions decreases in time. The fact that the energy E is displacement convex 
suggests such a contraction property (see e.g. [531 13]). The proof that we give is special in that it 
only requires the relatively weak solution concept of Definition ll.il In other words, the contraction 
properties are derived from the weak form of the differential equation itself. In particular, we have 
assumed no regularity in time for such solutions other than measurability and in this respect the 
following treatment differs from a typical treatment found in [3]. 

Tlieorem 3.1. Let p^ — {p^,L^,R^) and p^ ~ {p^,L^,R^) be two weak solutions. Then the 
function 

t^W2{p''{t),pHt)) 

is non-increasing for t £ [0, oo). 

We start with some preliminary lemmas. 
Lemma 3.2. Let p = (p, L, R) be a weak solution. 
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1. Then 

<-R{t) r-Ro 

p{t,x)dx = / po{x)dx, for almost all t > 0, (19) 
L{t) Jlo 

and 

xd^{t;x)= J xdfj,{0;x), for almost all t > 0. (20) 

Moreover, there exists a constant Umax such that \R(t)\, \L(t)\ < D-n^^y^. 

2. The function 

t ^ p.{t) 

(which is defined for almost every t > 0) has a representative which is W2-continuous on 
[0,cxd); the absolutely continuous part 1 1— > p{t) is a [L^ (M.), L°°{M.))- continuous on [0, oo). In 
addition L,R& C([0, oo)). 

3. For any C, G W^^^iM.) with sup,j.^^C" i^) < cxD; th,e function 

hit) := J Cix) dfi{t;x) 

satisfies for every t > 

lim < Z"^^*^ C'ix)p{t, x) dx + aC{L{t)) - aC'{R{t)). (21) 

eiO £ Jut) 

Proof. To start, we set ^{t,x) := ■ip{t)C{x) in with V e C^{{0,oo)) and ( e ^^(K). Then 
i}:'{t) C{x)d^i(t;x)dt^ / il){t){ / C'\x)p{t,x)dx + aC'{L(t))-aC'{R{t)))dt. 

J Jo [JL{t} J 

(22) 

Equahties (IT^ and ([20]) foUow from specific choices for C,. Choosing the sequence Cei^) ■= Ci^^) 
with C(0) = 1 and taking the limit e — yields p^: choosing the sequence Cei^) ■= Ci^^)/^ 
with C(0) = 0, C'(0) = 1 yields I^U^. This latter inequality, combined with the bound on i? — L 
fCorollarv l2.2p . gives the separate boundedness of L and R. 

For the second part, choose a representative for the function 1 1— > fi{t) (again denoted p). For 
any C S C'^(R) and the expression in curly braces in ([2^ is bounded uniformly in t; therefore 
hi^ G W^'°°{0,oo). Therefore there is a continuous representative (also denoted h(^) such that for 
t G (0,oo)\A^^, with iV^ a nuh set. 



hdt) - / Cix)dpit,x), (23) 

and 

/i' (t) = / C"ix)p{t, x) dx + aC{L{t)) - aC{R{t)). (24) 



Let D C C^(M) be a countable dense subset of Cc(M). Then we can define a null set N such 
that (Uni) holds for every ( & D and every t e (0, oo)\7V. We claim that if {tn}neN C (0, oo)\N 
with i„ — then p{tn) W^2-converges to a limit, and that this limit is independent of the choice 
of sequence (but depends, of course, on to). Assuming this for the moment, we can extend p to a 
VF2-continuous function on (0,cx)) by taking limits. 

To prove the claim above, first assume two time sequences both converge to to > 0, such that 
p converges in W2 to different limits pi and p2 ■ We then choose ( & D such that / (dpi ^ J (dp2 
this contradicts the continuity of Hq proved above. This shows that limits are independent of the 
sequence. Since the measures {p{t)}t£(o,oo)\N ^re tight, sequences {p{tn)} are PF2-compact, and 
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therefore contain a convergent subsequence; by the uniqueness proved above the whole sequence 
converges. 

To prove the cr(L^, L°°)-continuity we use a similar argument. By part[2]of Definition 11.11 anv 
sequence {p{tn)) is precompact in the (t{L^ ,L°°) topology (see e.g. [HllllS]); the uniqueness of the 
limit follows from the uniqueness of the W2-liniit. This concludes part [5] of the Lemma. 

For part H we fix the time t. Note that by jM]) for C e C^(R) and fixed e > 0, 

e Jt-e [J Lit) I 



'{x)~f p{s,x)dsdx + a-f {C{L{s)) - ('{Ris))) ds (25) 

Jt-s Jt-e 



For general ^ S Wj^'^^(]R) we smoothly truncate ( to zero outside of [— i^max^ 1, ^max+1] and define 
Cn ■— V'n * Ci where {ipn} is a sequence of moUifiers. Then h(^^{t) — > hi^{t), hQ^{t — e) — > /i^(< — e), 

— >■ C" in L^, and C'j (' uniformly on M. Because of the boundedness of the support of p, the 
terms in (j25p are independent of the choice of truncation. 

Next take the limit e | 0. By the continuity of L and , 



/ C{Lis))ds^CiL{t)). 



lim 



Since for any m > we have 

/ C{^)~f p{s,x)dsdx< / max{C"(a;), — to}4 p{s,x)dsdx, 

Jr Jt-e Jr Jt-e 

and since the function x H> max{C"(a;), — to} G L°°(R), it follows from the CT(i^, i°°)-continuity 
that 



^"(x)4 p{s,x)dsdx< / max{(" {x), '-m}p{t, x) dx. 
Jt-e Jr 

The left-hand side is independent of m, and therefore by the monotone convergence theorem 



lim / C" 

eiO,' 



lim / C"{x)J p{s,x)dsdx< [ C' {x)p{t, x) dx. 

£■1-0 Jr 

Collecting the parts yields the assertion. □ 

Lemma 3.3. Let p'^ : [0, oo) G he a weak solution and let p^ (£ G be arbitrary. Then the 
function t W2 {p'^ (t) , p^) satisfies for all p^ G G the evolution variational inequality 

ljWi{p'{t),p') + Eip'it))<E{p'), (26) 

in distributional sense. 

Proof. We will show that /i° satisfies 

^Wi{p"{t2),p') - iM/|(/(ii),/) < {t2-t,)E{p^) - f\{p\s))ds 

Jti 

for all < tl < ^2 and all p^ e G. (27) 

First, we show that it is sufficient to prove (|27p under the additional assumption that p^ is 
regular. Suppose that (l27l) has been proved for all regular p^. Approximate a general /x^ G G by 
regular pl^, such that 

W2 {p\pl,)^0 and pI p^ in (M) . 
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This can be done, for instance, by restricting on [L^,R^] to the value set and then 

renormalizing the result to regain the same mass as . Then for all t > 0, 

W2{pi\t),p^)= yyuvW2{pi\t),pl) and E{p^) ^ Yiuv E{pl). 

Therefore we can pass to the limit in (j27p for p\ and obtain the same inequality for p^ . 

We therefore continue under the assumption that p^ is regular. Lemma 12.51 provides a function 
ip{t, ■) S Wjq^^(R) that satisfies the equality (fT7|) at time t; by using the same function ip{t, ■) in 
the characterization (jlSp for time t — e, we find 

\Wi{p\t),p^) - ^Wi{p°{t - e),p') < I - ^{t, x)){dp'>{t; x) ~ dp\t - e; x)). 

Then 

limsupf ^l{p\t),p^) - Wl{p\t - 

< limsup - / {\x^ — (p{t, x)){dp^{t] x) — dp^{t — e; x)) 

< {l~^^,{t,x))p''{t,x)dx + a L\t)~^,{t,L''{t))-R\t)+^,{t,R''{t)) 



^ ip^^it, x)dx + a L%t) -L^- + i?i 



< E{p^)~E{p'^{t)). (28) 



Therefore the function 

fit) := ^Wi{p°{t),p^) + f[E{p\s)) - E{p')] ds 

JQ 

is continuous and has a non-positive left derivative at every i > 0; by an application of the next 
lemma it follows that / is decreasing, which is a reformulation of ([27]). □ 

Lemma 3.4. Let f e C(M) satisfy 

limsup - f{t - e)) < 0, for all t e M, 

Then f is non-increasing. 

Proof. Assume, to force a contradiction, that there exist ti < t2 with 

t2-ti 

Define 

git) ■.^fit)-fit,)-£it-t,). 

The function g is continuous, and therefore takes its maximum over the interval [ii,t2] at some 
ts G [^1,^2]- Note that giti) = git2) = 0, so that we can assume that <3 > ti. Since g is maximal 
at is, 

Iiminfi(5(t3)-ff(i3-e)) >0. 

On the other hand, 

limsup ^(5(^3) - 5(i3 -£)) = -^ + limsup i(/(i3)- /(is- £)) < 

eiO £ £4,0 £ 

which implies a contradiction. □ 
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Proof of Theorem \3.1l Defining 

d{s,t) lwi{p''{s),^iHt)) for s,t>0, 

the proof of the theorem is concluded by showing that d/ dt d{t, < in the sense of distributions. 
This is done by an argument of [H Lemma 4.3.4], which we reproduce here for completeness. 
Let C S C^((0, oo)), C ^ 0, and calculate for sufficiently small e > 

' dit, cwniii^ dt = f m'^'^'^-'^'-'^'-'-^dt 

° 'cw^^M^ifc^rf^+ rat+e)'-^^^^±^l^^dt. 



Now note that by (P7|) we can deduce 

- d{t,t)^^^^^^^^^dt< J\{t)^(^sE{^lHt))~ J' E{n"{s))ds^ dt 

+ ^ C{t + s)^ (^eE{^l"{t)) - jj^ E{^l\s)) ds^ dt. 

The integrands at the right hand side are bounded from above. Moreover, by the M^2-continuity 
of /x"'^ there exists a constant C > such that 

< C for all t, {t — e) E suppC and e sufficiently small. 

The dominated convergence theorem, Fatou's Lemma, and the fact that E is lower semi-continuous 
with respect to the Wasserstein metric fLemma l2.3p . yield 

d{t,t)C{t)dt< [ C{t)limsup(E{n\t))~i E{p° (s)) ds)] dt 

Jo eiO V Jt-e J 

' rt+E \ 

E{fi"{t)) E{fi\s))dsj dt 

< I at){E{^^'{t))~E{^l^{t)) + E{p^{t))~Ei^,\t)))dt<o. 

□ 



lim sup 



4 Existence of solutions 

The proof of existence of a solution follows the well-known argument of time-discretization. Similar 
proofs in the Wasserstein context have been given in [21j |23i |lj . We provide the proof here for 
completeness. 

Theorem 4.1. Let fiQ € G. Then there exists a weak solution of problem 0) with initial datum 
fiQ as defined in Definition \l.l\ 

The main steps in the proof of Theorem 14 . II are 
Step 1 (Time-discrete minimization problem). Let h > and cr e G be fixed, and define 

IhA^^) = ^wi{a,^l) + E{^l), ^ov^,eG. (29) 

Lemma 4.2. There exists a unique minimizer /i — {p,L,R) G G of Ih.a- It satisfies the Euler- 
Lagrange equation 



\ I C{x)d{p{x)~(j{x))~ I p{x)C"{x)dx + aC\R)~aC{L) 



h 

for every C, E 



<^K"\\ooWi{a,p) (30) 
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step 2 (Approximate solutions). For h > fixed, and for given initiai datum jiQ E G, 
construct tlie sequence (/i^)i>o by 

/i'*^]^ minimizes I^ ^^h, and /ip ■— Mo- 
Define tlie approximate soiution /i'' : [0, oo) G by 

n''(t):=n'l if te[ih,{i + l)h). 
Lemma 4.3. For all C e C^^{R x R), a«rf T such that C,{t,x) = whenever t > T, 

d^({t,x)dn''{t;x)dt - ^ J J ({t, x)dno{x) dt 

p'\t,x)C^^{t,x)dxdt + a I Cx{t,R''{t))dt-a [ Cx{t, L^'{t)) dt 



<x||C.x||oo E W^I(Mtl,M.'), (31) 



where the discrete derivative is defined by 

d^c{t,x) ,^ cit + h,x)^~at,x) ^ 

step 3 (Estimating the error term). 
Lemma 4.4. For all h > 0, 

IT/h] 

E W^iti1_,,ti1)<2hiE{^o)~loga^l). (32) 

i=l 

Hence the right-hand side in approaches zero as h ^ 0. 
Step 4 (Convergence). 

Lemma 4.5. There exists a sequence hj — > and a uniformly W2- continuous function fjP : 
[0, 00) — >■ G such that for all T > 0, 

sup W2{n^'{t),n°{t)) ^0 as j 00. (33) 

te[o,T] 

Step 5 (Conclusion). 

Lemma 4.6. The limit fjP is a weak solution. 
We now proceed to prove these lemmas. 

4.1 Proof of Lemma 14.21 

We first state and prove an auxiliary lemma. 
Lemma 4.7. For any a E G and M > the set 

G^' := {m e G : E{ii) < M and W2{a, n) < M} 
is compact with respect to the metric W2- 
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Proof. Since the bound on W2{a', fi) implies bounds on the support, G^^ is tight and therefore 
precompact with respect to measure convergence; the same bounds on the support also imply that 
second moments of elements in G*^ are uniformly bounded, and therefore G^^ is precompact with 
respect to W2- 

We next show that a limit point /i of G*^ has the same structure as G itself, i.e. that /i can 
be written as 

^i^pC + PSl + PSr, (34) 

for some p G L^(]R) and L,R £R, with supp p C [L, R]. Taking a sequence ^" = (p", L", i?") G 
, the left and right boundaries converge since by (fT4)) 

by which (-L")„ and are Cauchy sequences with limits (say) L and _R. 

As in the proof of Lemma 13.21 the bound on / p"logp" implies (t(L^, L°°)-precompactness, so 
that there is a subsequence that converges weakly in the (t(L^,L°°) topology to a limit p. This 
convergence implies VF2-convergence of p", since the bounds on L" and i?" imply a bound on 
the second moment of p". Combining this with the convergence of the left and right boundaries 
implies that p satisfies ([M)) . The fact that supp p G [L,R] is a direct consequence of the measure 
convergence. 

Finally, the lower semincontinuity of E (Lemma 12. 3|) implies that any limit point p of G^^ 
satisfies E{p) < M; therefore G*^ is W2-closed. □ 

By this Lemma the Vl^2-lower semicontinuous functional Ih^a- has a global minimizer po in G. 
We now show that satisfies ([50]) . We construct a perturbation by the method introduced in [TS] 
(see also [13] or [T]). We choose ip G C^{R) and define the flow map ($£)eeR in G°°(M) by the 
pointwise differential equation in e. 




Id. 



Then $e is a G^-diffeomorphism for all e G R, and 



d 5$e(a;) 



ds dx 



e=0 



dx de 



E=0 



e=0 



Next we define the probability measure p^ :— ($£)#/io. Since is monotonic, //^ G G. 

Let p^ = {p^,Le,Re) and ct = {p,L,R). Then = (*£)#Po, = *e(-^o), and R^ = «'e(^o)- 
Because //o minimizes Ih^a we have 



Ih,aiPe) - Ih,a[po) 



> 0. 



(35) 



Following the calculations in any of [121 [231 [I] we have 

1 d 



~Wiip.,p) 



:=0 



{x - y)ip{x)dj{x,y), 



where 7 is the optimal measure in T(pQ,p). Combining this with (|14p we obtain 



1 d 
2de 



Wi{p„<j] 



e=0 



(x - y)^ix)d^ix, y) + /3(i?o - i?) ^^^e{Ra] 



e=0 



de 



{x - y)2P{x)d^{x, y) + /3(i?o - i?)^(i?o) + /3(io - i)^(io) 
(a; - y)ip{x)d'-f{x,y). 



e=0 
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Here 7 is the optimal measure in r(/xo, cr). 

Furthermore, again following |15] or [1], we have 



e=0 



Finally, 



d_ 

de 
d_ 

de 



aRe 



£=0 



e=0 



po{x)ip' {x)dx. 



aip{Ro), 
-aip{Lo). 



(36) 



Hence, if we let e approach in (|35|) . we obtain 
1 



{x - y)%p{x)d^{x,y) - / po(a;)V''(a;) c^a^ + aV'(^o) - aV'(io) > 0. 



Since V' is arbitrary, actually equality holds. Now take C e C^(R) and set = C,' . We find 
1 



{x-y)C\x)d^{x,y)^ / po(x)C"(a;)dx + aC'(^o)-<(io) = 0. 



We use that 



and that 



C{y) = C{x) + {y- x)C{x) + -{y ~ x)\"{9,yX + (1 - 0,y)y). 



where O^y e [0, 1], to find (jSO 



4.2 Proof of Lemma [4.31 



This Lemma follows from a simple rearrangement of the discrete time-derivative terms. Let 
f e C;f (R X R), and let T be such that ^{t, x) = for all t > T and xGR. Assume, without loss 
of generality, that T/h G N. By for aU t G (0,T] and i such that t G [ti-i,ti), 



^Jjit,x)difi':ix)-fil^ix))^J^p'^{x)Uit,x)dx + aUt,R'l)~(^Ut,L^^ 
If we integrate this inequality over t G (0, T) and use the triangle inequality, we obtain 



i=l V'^ "'K -'K 



T/h 



<i||e..||ooEw^2(A^ti,A^.')- (37) 



Since 

T/h 

E 

4 = 1 "^'-1 



^ f(t,a;)d(M!'(a;)-/iti(x)) / / a;)^^^^;)^^ - - 



^(t, x)dpo{x)dt, 



we deduce ([51]) from ([57]). 
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4.3 Proof of Lemma [4.41 

For each j e N, /ij* minimizes ^ , so that 

or 

We use a telescoping-sum argument and Lemma |2. II to see that 

T/h 
i=l 

4.4 Compactness and proof of Lemma 14.51 

The Arzela-Ascoh theorem is based on a combination of equicontinuity in time with compactness 
of time sUces. In the case at hand, the functions t fJ.^{t) are not continuous, but satisfy a weaker 
condition that is expressed in the foUowing lemma. 

Lemma 4.8. There exists a constant C , depending only on pQ, Lq, Rq and a, such that for all 
h> and ^2 > > 0, 

W2 {p''it2),fi'\t,)) < CVt2 -ti+h. (38) 

One might think of (j38p as an 'equi-near-continuity' condition; for finite h > jumps are 
possible, but the maximal size of the jumps converges to zero as /i — > 0. 
Since Lemma 14.81 also provides a bound 



sup W2{p''it), Ho) < CVtTT, uniformly inke (0, 1], 



the set { Ai^(t) : he {0, 1], t e [0, T] } is a subset of G*^ with M = ma.x{E{fio),CVTTT} , and 
therefore precompact in W2 by Lemma 14.71 With this compactness and Lemma 14.81 the proof of 
the Arzela-Ascoli theorem is easily adapted to yield the existence of a sequence hj and a limit 
/i° : [0, 00) ^ G such that for any T > 0, 



sup W2{p'''{t),n°{t)) ->0 asj^-oo, 

tG[0,T] 



thereby proving Lemma 14.51 
Proof of Lemma \4-8\ Define 



ii := [ti/h\, 12 := [t2/h\, 

so that 

«2 - «i < — r ^ 1- 

h 

Applying the triangle inequality, the Cauchy-Schwarz inequality, and the bound on the difference 
in internal energies that was also used in the previous theorem, we find 



W^2(M"(i2),/(ii)) - W2{^^l,^i1J 



j=ii+l 

1/2 

< ^^t2-t, + h[2h {E{pI , ,Rlj- E{pI , Ll , i?,^ ))] 

= C^t2 -ti+h. 
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with 

C := V2^E{po,Lo,Ra)~loga + l. 

□ 

4.5 Proof of Lemma 14.61 

To show that the hmit jjP is a weak sohition, we first note that part [1] of Definition 11.11 is satisfied 
by the definition of G and the W2-continuity of /i", and that part [5] foUows from the bound 

E{tJ°{t)) < hminf £;(^''^ (t)) < E{fio). 

To show (|n]), we first note that from ([55)1 and (ITil) follows that for any T > 

fi''' {t) fi°(t) and p''^ {t) p°{t) as measures, for aU t € [0, T] 

L'h j^h^^j^o^j^o in i°°([0,T]). 

With this convergence, and the estimate (1321) . we take the limit in (|3T|) to find ([9]). 

5 Alternative proof 

Our proof of Theorem 14.11 was largely self-consistent. Yet, we see advantages in putting our work 
in the framework of the book on gradient fiows by Ambrosio, Gigli and Savare [4], to which we 
will refer in this section. Because the functional E is lower-semicontinuous (see Lemma |2.3|) . and 
PF2-bounded subsets of of a sublevel of E are relatively compact fLemma 14.71) . Assumptions 2.1a 
and 2.1c from [4^ on the functional are fulfilled, while Property 2.1b is immediate. Since the 
energy E is geodesically convex, 4, Assumption 2.4.5] is fulfilled as well. Consequently, we may 
conclude that our discretisation scheme yields the so-called curve of maximal slope /i : R+ — > G 
with respect to the strong upper gradient of our energy functional E (see [H Corollary 2.4.11]). 
Due to the geodesic convexity of E, this strong upper gradient in a point fj, € G equals both the 
local and the global slope of E in /i, that are defined by (see also [H Definition 1.2.4]) 

\dE\{p) := hmsup ^ , \ , Te{p) sup ^ . ■ 

In the following proposition we will compute the strong upper gradient of the internal energy E. 
Proposition 5.1. Let ji — {p,L,R) E G. Then 




P 
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Proof. Let /i = {p,L,R) E G. We will first bound Te{ij) from below by estimating 



Te(^) = sup 



(E{p) - E{v))+ 



> sup - 

Tec}- 

>sup ^('^)-^(^#^) 



sup sup 

Tec^ o<t< 



sup sup 

Teci 









- E{Tt#ii) 



where Tt{x) := (1 — t)x + tT{x). Furthermore 

Ejfi) - E{Tt#^,) ^ E{^l)-E{T,^^^) _ 
o<t<i i *4-0 t 

The limit on the right-hand side equals 

lim ^(^) ~ EiTt#^l) ^ I _ ^^^^^^ _ a{T{R) - R) + a[T{L) - L). 

Consequently, 

/(r (x) - - a{T{R) -R) + a{T{L) - L) 

TeW > sup , 

Tecl ^Jix-T{xWdf,{x) 

JS'ix)pix)dx~aSiR)+aSiL) 
= sup ^ — . (39) 

seci \\j\\L2{fi) 

Suppose this supremum is finite. It follows that the operator — > R given by 

[ S'{x)p{x)dx-aS{R)+aS{L) 



can be extended to a bounded operator (/i) — >■ M, which in turn can be represented by an (p) 
inner product with a vector. From this it follows that p'/p E L'^{p). We therefore have proved 
that if p'/p ^ L'^ip), then Te{p) = oo. 

We now continue under the assumption that the supremum p9p is finite, and therefore that 
p'/p G L^{p)- Note that therefore the boundary values p{R) and p{L) are well-defined. We 
integrate by parts and apply Cauchy-Schwarz to find 

-IS{x)!^pix)dx-SiR)ia~piR))~SiL)ipiL)-a) 
Te{h) > sup ^ 

seel 



^,i(p(L)-a),i(a-p(i?)) 



(40) 



For the upper bound, we note that for each v and e > there exist that have smooth, 
strictly positive densities and \E{i') — E{i/'^)\ < e and W2{iy, i^^) < e. Therefore, for each v G G, 

(Eip) - E{u))+ ^ {E{p) - E{u-))+ 
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Because of the regularity of the v"^ , it holds that the optimal maps such that ^ — are 
in Cl- We define Tf{x) := (1 - t)x + T''^ (x). Due to displacement convexity, if E{v) < E{^), 

{Ei„) - E{u))+ E{p)^EiiTrhiyn 

JiiT^'yix) - l)p{x)dx - a{T''\R) ~ R) + a{T''' {L) - L) 



sup 

e>0 



Therefore, 

'^e{p) < sup 



^j{T'^^{x)~xfd^l{x) 
J {T'{x) - l){x)p{x)dx - a{T{R) - R) + a{T{L) - L) 



Ted 



^J{x - nx)Ydp{x) 

Assuming p' / p G L^(p) we can partially integrate and apply Cauchy-Schwarz as before to get 
TEip) < 



^,^{piL)-a),^ia-pm 



□ 



Having the expression for the strong upper gradient at hand. Theorem 2.3.3 in f4^ provides us 
with the energy identity 

i \p'Wt)dt + \dE\\p{t))dt + E{p{T)) = E{po), for ah T > 0. 

Here, the metric derivative is for almost every t defined by 

s^t \s — t\ 

and it is an element of L^(0,r). At the same time, \p'\ = \dE\ at almost every t. Therefore, the 
energy identity gives rigorous sense to the formal result for the dissipation rate ([3]) we established 
in the introduction. Finally, the regularizing effect as described in [3J Theorem 2.4.15] implies 
that the solution satisfies \dE\{p{t)) < oo for all t > 0. In particular, this means that 
p^{t; x) / p{t; x) e L\p{t)) for aU t > 0. 

The techniques used in Chapter 2 of [4 to prove existence of minimizing movements are very 
similar, yet more general than the techniques we have used to prove existence. In Chapter 4 
of the book, under the more stringent Assumption 4.0.1, it is shown that even somewhat better 
results can be obtained. Because in the one-dimensional case, for every measure p with finite 
second moment the function v ^ W|(/i,z^) is 1-convex along geodesies U (9.1.7)], Assumption 
4.0.1 follows immediately for the case at hand. This enables us to apply [J, Theorem 4.0.4]. In 
the existence proof, one can use a slightly more general initial condition, that the initial datum is 
in the closure of the domain of E. Moreover, the regularity result becomes stronger with actual 
bounds on \dE\{p). 

Another nice result of [U Theorem 4.0.4] is that the solution curve p : G is the unique 

solution of the evolution variational inequality (see 4, (4.0.13)] with A = 0) 

i^VF|(^(t),CT) < E{a) - E{p{t)), for a.e. t > 0, for all ct G G, (41) 

among all absolutely continuous curves such that linit^o p(t) = po. We will now show that such a 
solution is a weak solution in the sense of Definition 11.11 as well. Theorem 8.3.1 of [1] yields the 
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existence of a vector field v(t, •) such that the following formulation of the continuity equation is 
fulfilled: 




{Ct{t,x)+v{t,x)CS,x))dfi{t;x)dt:^0, for aU C e C;?°(R+ X M). (42) 



Expressed in this vector field, the derivative in equation (pij) reads 

2dt^'(A^W,'^) = J^v{t,x){x-T'^{t,x))dfi{t), 

where T°'{t, •) : R — )■ M is the map that pushes fi{t) forward to <t. We take an arbitrary function 
C G C^(R X R+) and we introduce the fiow maps 

$0 = Id. 

Subsequently, we define cr' :— ^l^^J■t, and we substitute this for a in the evolutional variational 
inequality (HJ). By dividing by e and letting e — > (compare Section H?T|) . we derive 

- / v{t,x)Cx{t,x)d^{t) ^ - / p{t,x)Ca:xit,x)+aC^{t,R{t))~aCAt,L{t)). 

Next, we integrate over time, and use the continuity equation (I42p to perform a partial integration 
on the left hand term. With that we have proved that ii{t) is a weak solution according to 
Definition 11.11 

In Lemma 13.31 we have already proved that a weak solution in the sense of Definition 11.11 is 
a solution to the evolution variational inequality in distributional sense. It is important to note 
that we did not assume weak solutions to be absolutely continuous. Consequently, the result 
of uniqueness was not implied by the uniqueness of solutions to (j4ip . Combining the results 
of the paper, however, we can state that there is exactly one weak solution, and that solution 
satisfies (|tT|) . 
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A A heuristic explanation of the Wasserstein gradient flow 

We first recall the structure of a gradient fiow in Riemannian geometry. On a smooth Riemannian 
manifold M the gradient of a functional E : M R is defined by the condition 

S(p(grad£;(p), s) = E'{p) ■ s for aU s e TpM. 

Here TpM is the tangent plane at p € M, gp the local metric tensor, and E' {p) is the Frechet 
derivative (or differential) of E at p. The gradient flow of E on M is then the evolution equation 
given by 

dtp{t)^-gr&AE{p{t)), 

or equivalently 

5p(t) {dtp{t), s) - -E'{p{t)) ■ s for ah s e Tp^t)M. (43) 

Essentially, we wish to apply this abstract concept to the case of the Wasserstein metric. 
Unfortunately, this is not the metric of a smooth Riemannian manifold. Strictly speaking, the 
concept and language of gradient flows on Riemannian manifolds are therefore unavailable for this 
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metric (although various ways are known to circumvent this problem [531 El El H] ) • Despite this 
fact, the setting of Riemannian gradient flows does provide the best insight for deriving formulas 
and formulating conjectures, and we therefore choose this setting to motivate the results. 

At any fixed time i, we consider the unknown in the free boundary problem to be an element 
of the set 

M := :peL^{R'^), supp p C 1^ C M'*, ^ p = m| , 

for some fixed to > 0. If t {rt{t), p{t, •)) is a curve in M, then for any G C(R'') we have 

d_ 

dt 



(j3{x)p{t,x) dx ~ I (f>{x)p{t,x)dt^{t,x) ds{x) + / (j3{x)dtp{t,x) dx. 
n(t) Jdn{t) Jn{t} 



Here we use the notation dt^{t,x) for the (exterior) normal velocity of dft{t) at the point x. At 
any {fl, p) € M, we therefore identify the tangents to M to a pair (i;, s), where v : dVt — >■ R is the 
normal velocity of dVt and s G L^(i7) represents the change in p. The conservation of total mass 
implies that at a point (ri,/?) € M all admissible pairs (u, s) should satisfy 

p{x)v{x) ds{x) + / s{x)dx — 0. 
on Jn 

On M we define the energy function 

E{n,p) -.^ I [p\ogp + a], 

where a > is a constant, and as (formal) Riemannian metric tensor we choose 

9{n.p){{vi,si),{v2,S2)) / p{x)\/pi{x)\/p2{x)dx + / h{p{x))vi{x)v2{x) ds{x). (44) 

Jn Jon 

Here h is a positive function that will be related to / — see below — and pi and p2 are related to 
the tangents (wi,si) and (w2,S2) by 

si + div pVpi — 0, S2 + div pVp2 — in 

dpi dp2 „„ 

^r— = vi, ^— = V2 on dil. 

on on 

We claim that formally this gradient flow reduces to ([T|). To show this, let t i— ^ (ri(t),p(i, •)) 
be a path in M, satisfying at each time t the gradient- flow equation which becomes 

9(nit)^p(t)){{dtn, dtp), (V2,S2)) = -E'{n(t),p{t)) ■ {V2,S2) for aU (1^2, ^2) and t > 0. (45) 

We now choose a t > fixed. The first term in P5)) is by definition 



p{t, x)Vpi{t, x)Vp2{x) dx + / h{p{t,x))dt^{t,x)v2{x) ds{x), 
n(t) Jan{t) 



where 



dtpit, •) + div p{t, •) Vpi =0, S2 + div pVpa =0 in n{t) 

dpi „ „ ^ dp2 „„ ... 

— — = dt^{t, •), ^ = ^2 on dn[t). 

On an 

By partial integration these terms can be written as 

dtp{t,x)p2{x) dx + [ p{t,x)-^{x)p2{x) ds{x) + ( h[p{t,x))dt^{t,x)v2{x) ds(x). 
n(t) Jdn(t) on Jdn(t) 



(46) 
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On the other hand, suppressing variables x and t for readabiUty, 



E'{n,p) ■ {v2,S2) ^ / (logp+l)s2+ / {p\ogp + a)v2 
Jn Jon 



= (logp + 1) divpVp2 + / (plog/O + a)u2 
Jn Jon 

= / VpVp2- / (logp+l)p— ^+ / {plogp + a)v2 
Jn Jan on Jg^ 

[ P2^ + / (a - p)v2. 
Jan on Jq^ 



Jn 

Combining this with (|46|) and (|45]) we find 



{h{p)dt^ + a- p)v2- 

an 



By making various choices for p2 and V2 it fohows that each of the three expressions in parentheses 
is identically equal to zero. These three expressions correspond to the three equations of m, in 
the following form: 

dtp ~ Ap in f2(t) 

dp 

-TT- = -pVn on dV,{t) 
on 

p — a 

Hp) 

where we have reverted back to writing u„ for the normal velocity. Note that the function / 
from (fTc)) is forced to have a specific form, {p — a)/h{p). Since the function h is necessarily 
positive, this implies that the sign of /(p) is determined by this structure: there should be exactly 
one critical value of p at which the sign of / changes, from negative (for smaller p) to positive. 

Also note how the function / is determined by a combination of energy and metric properties: 
the sign is determined by the energy parameter a, while the rest of the rate is given by the metric 
parameter h{p). 



B Proof of Lemma 12.5 

Proof. The transport map for W2(/0°,p^) is convex on R and achieves equality in (fT5|) for 
i.e. 

hWnp",p')^ I {\x^-m)p\x)dx+ f C^y^^^*{y))p\y)dy. 

JR Jr 

The map tp is not uniquely defined on R\ supp hence we can adapt it slightly. Let [io, ^o] C M 
be the smallest interval such that supppo C We define a new convex function </? by: 

{(p{x) if <y < R° 

<piL°) + L\x - ifx<L" 
^{R°) + R^(x-R°) iix>R°. 

Because (px is uniquely determined p'^-a.e. and can be interpreted as a transport map, 

dp>{x) C[L\R^], for all a; G (L°,i?°). 
Therefore, ip is indeed convex, with conjugate p* equal to 

I oo otherwise. 
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For the second derivative of Lp* , which exists almost everywhere on (i^, i?^), we have 
^lyiy) = Vlyiy) = ^off(i,)) > fo'- all L'<y< R\ 

and therefore the function (p* is strictly convex on [L^,R^]. It follows that ip is essentially 
smooth '24', Theorem 26.3], implying that (p is differentiable on R. Since is a differentiable con- 
vex function with bounded derivative, (p^x G ^"'^(IR-), and p e W^^^(R). As ipx{Lo) = ipx{L°) = 
and Lpx{R°) = = we find L^L^ = <y9(L0) + and R^R^ = tp{R°) + ip*{R^). 

Finally wc show pT]) . First note that 

{L°-L'f = (L0)2 - 2i°Li + (Li)2 

= (L0)2 - 2(^(L°) - 2^*{L^) + {L^ f 

We then have by monotonicity 

= f(L«-Li)2+ ( {\x^~^{x))p\x)dx+ j {\y^~^*[y))p\v)dy+l-{R'-Ry 

+ I {\x^-^{x))p\x)dx+ [ i^y'-^*iy))p\y)dy 
Jr Jr 

= j{\x^-v{x))dp\x)+ jc^y^-v*{y))dp\y). 

□ 
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